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Abstract — Network coding is known to improve the througliput 
and tlie resilience to losses in most network scenarios. In a 
practical network scenario, however, the accurate modeling of 
the traffic is often too complex and/or infeasible. The goal 
is thus to design codes that perform close to the capacity of 
any network (with arbitrary traffic) efficiently. In this context, 
random linear network codes are known to be capacity-achieving 
while requiring a decoding complexity quadratic in the message 
length. Chunked Codes (CC) were proposed by Maymounkov 
et al. to improve the computational efficiency of random codes 
by partitioning the message into a number of non-overlapping 
chunks. CC can also be capacity-achieving but have a lower 
encoding/decoding complexity at the expense of slower conver- 
gence to the capacity. In this paper, we propose and analyze 
a generalized version of CC called Overlapped Chunked Codes 
(OCC) in which chunks are allowed to overlap^ Our theoretical 
analysis and simulation results show that compared to CC, OCC 
can achieve the capacity with a faster speed while maintaining 
almost the same advantage in computational efficiency. 

I. Introduction 

In the context of information flow over the packet networks, 
network coding is known to generally improve the network's 
throughput and resilience to losses [1]. A packet network can 
be represented as a graph, either directed or undirected, each 
of whose edges corresponds to a link between two network 
nodes. Each link is modeled as a packet erasure channel. 
The flow of the transmissions is called the traffic. Mostly in 
literature, the traffic is modeled by either a stochastic process 
or a deterministic description of the transmission instances. A 
problem of interest is to design codes that achieve the capacity 
of a network with a given traffic model. 

In practice, however, the accurate modeling of the traffic is 
often too complex and/or infeasible [4]; namely, either the 
set of active nodes varies over time or the link capacities 
dynamically change due to the congestion or any other cross 
traffic phenomena. The packets may experience unconstrained 
delays and even be reordered. The network nodes might not be 
aware of successful transmissions and the feedback solution 
might not be appealing; because it may either contribute to 
additional delay or be too difficult to implement. This implies 
the need for ruleless codes in the following sense: each node 
constructs and transmits new coded packets regardless of 
which packets have been received by the destination nodes. 
Hence the code design problem from an information-theoretic 
perspective translates into devising codes that perform close 
to the capacity for any arbitrary traffic. 

'Very recently, we found out that the idea of CC with overlapping chunks 
has also been proposed in [3] independently. Unlike this paper, however, no 
theoretical analysis is presented in [3]. 



Random linear network codes (a.k.a. dense codes) are 
known to be capacity-achieving over the networks with any 
arbitrary traffic. But this comes at the cost of computational 
inefficiency; each node requires 0{k) operations per input 
symbol to encode or recode k symbol-long messages. The 
decoder also requires 0{k^) operations per input symbol 
to invert a k x k matrix as well as 0{k) operations per 
input symbol to apply the inverse matrix to the received 
coded symbols. Therefore, a question from a computational 
complexity point of view is how to design capacity-achieving 
codes that are more efficient. By applying random codes on 
a chunk (a smaller sub-message of the original message), 
the encoding and decoding require a number of operations 
proportional to the size of the chunk which is smaller than k, 
the size of the original message. This is the main idea behind 
the design of Chunked Codes (CC) in which the message is 
split into chunks [4]. Each node decides which chunk(s) to 
transmit at any time instant using a random linear code. The 
code design problem has thus to deal with the following issues: 
how to chunk the message, and how to schedule the chunks 
to be coded and transmitted. 

Maymounkov et al. [2] present a solution for designing CC 
that can provably achieve the capacity for any arbitrary traffic. 
They partition the message into non-overlapping chunks of 
equal size. Each node applies a random linear code to a 
randomly chosen chunk at any point in time. Each chunk has 
a unique content and so has to be decoded separately. By 
random scheduling, however, it may take too long for some 
chunks to be decoded. This reduces the speed of convergence 
to the capacity. 

In this work, we design CC in which chunks are allowed 
to overlap. The decoding of one chunk, in this case, would 
provide some information that can help in decoding the other 
chunks. Consequently, the speed of convergence to capacity is 
increased while preserving almost the same advantage in terms 
of the computational efficiency. In addition to asymptotic 
results, we also demonstrate through finite-length simulations 
that when CC cannot be designed to be capacity-achieving, 
the application of the proposed coding scheme can reduce the 
overhead by about 60% and 29% for the line networks of 
length 1 and 2, respectively. 

II. Problem Setup 

In this paper, we focus on the unicast network problem 
over line networks]^ A line network of length I is a collection 

-More general network scenarios can be analyzed in worst-case through 
the union bound analysis using the results presented here. 



of I links connected in tandem whose node set includes one 
source, one terminal and some internal nodes. Each link is 
assumed to be an erasure channel. The transmitted packets 
may be erased with arbitrary probability and be faced with 
arbitrary delays. The set of successful transmissions is called 
a schedule to which the network nodes are assumed to be blind. 
The schedule describes paths over time on which packets can 
be transmitted through the network. 

The schedule can be modeled by a graph specified as 
follows. For each network node v, the graph includes a set 
of nodes {v, t') so that at each time t', the node v either 
successfully transmits or receives a packet. An edge between 
any two nodes {v,t') and {w,t") is a traffic edge indicating 
a successful transmission initiated at node v at time t' and 
received by node w at time t". Without loss of generality, each 
traffic edge is assumed to have a unit capacity. Furthermore, 
an edge between any two nodes {v,t') and {v,t") represents 
a memory edge with an infinite capacity. Let s and t denote 
the source and terminal nodes, respectively. The capacity of 
the min-cut between (s,0) and (t,oo) is called the capacity 
of the schedule. The set of paths that traverse the same set 
of nodes are referred to as a^ow path. Furthermore, any line 
network consists of only one flow path whose number of paths 
equals the network's capacity. An example of a schedule with 
capacity 4 over a line network of length 3 is given in Figure 1 . 

We suppose that node s is given a message of k information 
symbols, where the symbols are strings of bits. Consider a line 
network with an arbitrary schedule of capacity n. Our problem 
is to design a coding scheme that achieves the capacity of the 
schedule, i.e., the number of information symbols, fc, that are 
successfully transmitted from s to t, asymptotically approaches 
n, as k goes to infinity. We are also interested in schemes with 
low computational complexity. 

III. Codes Analysis 

We start with an overview of the results presented in [2], 
as related to our analysis of the overlapped chunked codes. 

A. Dense Codes 

In a random linear coding scheme, each node transmits 
a coded symbol, i.e., a random linear combination of all 
previously received coded symbols. Associated with any coded 
symbol is a payload vector, i.e., the vector of coefficients 
that represents the mapping between the input symbols and 
the coded symbol. Each payload vector is assumed to be 
transmitted together with its coded symbol as a packet. 

The set of received packets by the terminal with linearly 
independent payload vectors are called innovative. To decode 
k input symbols, k innovative packets are necessary and 
sufficient. By randomly combining packets, however, there 
are packets whose payload vectors are linearly dependent 
on the innovative packets' payload vectors. These are called 
non-innovative. The number of non-innovative packets can be 
investigated through a worst-case analysis. This provides a 
lower-bound on the number of received packets among which 
k packets are innovative. 



One can see intuitively that the probabiUty of generating a 
non-innovative packet increases as the packets travel from s 
to t. That is, each layer of the network adds extra dependency 
among the transmitted packets. Let Qi, i — l,...,l, be a 
matrix of size k x n whose columns are the payload vectors 
of the n packets received by node Vi (s = tjq, t = vi). For 
Qi, all the columns have i.i.d. Bernoulli variables, i.e., the 
number of columns with mutual dependency is zero. As we 
proceed down the layers of the network, however, Qi would 
have some columns that are not mutually independent. We 
denote the collection of these columns as a sub-matrix Vi of 
Qi. We also denote the columns of Qi \ Vi which are not 
linearly independent as a sub-matrix V'^ of Qi. In particular, 
at the terminal node, the number of innovative packets can 
be bounded by the sum of the number of the columns of Vi 
and V'l. To find a lower-bound on the capacity n needed for 
the transmission of k packets, one would need to find upper- 
bounds on the number of columns of Vi and V'l. 

Let M he a k X n matrix whose entries are each uniform 
Bernoulli random variables. The set of columns of M that are 
mutually independent are called dense, and M is called dense 
when all its columns are dense. Let I?(M) denote the number 
of dense columns of M. 

Lemma 1: [2, Lemma 3.8] Applying a Dense Code on a 
flow path with capacity n and length I, ^{Q) is larger than 
n — nog(/n/e) with probability of at least 1 — ej^ 

The term /log(/n/e) is an upper-bound on the number of 
columns of Vi. The number of columns of V'l is also bounded 
above by log 1 /e as the following shows. 

Lemma 2: [2, Lemma 3.5] A k x n dense matrix M fails 
to have full row-rank with probability of at most e, so long as 
n is at last A: + log(l/e). 

Lemma [T] together with Lemma |2] give us an upper-bound 
of the total number of non-innovative packets at the termi- 
nal. This yields the following result for Dense Codes under 
arbitrary schedules. 

Theorem 1: [2, Theorem 3.9] A Dense Code on a flow 
path with capacity n and length I fails to deliver k information 
symbols with probability no more than e, so long as n is at 
least k + l\og{kl/e) + log(l/e) + / + L The encoding and 
decoding costs are each 0(fc)j^ 

B. Chunked Codes 

Chunked Codes operate by dividing the k input symbols into 
q non-overlapping chunks, each of size a = k/q. The size of 
chunks is also known as the aperture size. Each node randomly 
chooses a chunk and transmits a random linear combination 
of received packets pertaining to the chosen chunk. Any 
transmission associated to the chunk oj E [q] is called an uj- 
transmission whose associated packet is an uj-packet. 

Each node Vi successfully decodes k input symbols so long 
as Qi, i.e., the matrix of the received payload vectors by Vi 

^Hereafter, the probability of decoding failure is assumed to be inversely 
polynomial in k, i.e., e = 1/fc'^ for some positive c. 

^The encoding and decoding costs are the number of operations per input 
symbol required for encoding and decoding tasks, respectively. 
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Fig. 1 . A line network of lengtli 3 with a scliedule of capacity 4. 

has rank k. Let be Q, restricted to the columns associated 
with tiJ-transmissions. Each chunk uj is decoded so long as 
Qlj has rank k/q. This means the completion of decoding of 
each chunk uj is independent from that of the other chunks. By 
Lemmas [T| and [2] this depends on 'D{Qi^) and the number of 
w-packets at node Vi. We thus need to determine the number 
of received w-packets by each node. This quantity is a random 
variable because of the random nature of the scheduling and 
hence may deviate from its expectation. This deviation has 
therefore to be studied in a worst-case scenario. We give a 
simple example to explain this. 

Suppose a CC with two chunks on the schedule depicted in 
Figure 1 . Let us name the chunks A and B. Each transmission 
is randomly chosen to be either a ^-transmission or a B- 
transmission. The ^-transmissions and ^-transmissions are 
represented by solid and broken line arrows, respectively. By 
the definition, the min-cut capacity of the schedule restricted 
to the ^-transmissions (or ^-transmissions) equals the number 
of disjoint sets of ^-packets (or S-packets) that form valid 
paths of flow from s to t. For example in Figure 1, the B- 
transmissions form a schedule of capacity 2. Furthermore, the 
capacity of flow of ^-transmissions is 1. The expected number 
of paths by S-transmissions (or ^-transmissions) is 2. (The 
number of disjoint paths is 4 and this is to be divided between 
the two chunks.) Thus, one path is lost and this is because of 
those transmissions that can not be matched with any (on the 
upper links) to form distinct paths of flow from s to t. These 
are called the lost transmissions. 

The above example raises the key points in the analysis 
of CC as follows. First, the number of lost tj-transmissions 
has to be taken into account to provide an estimate of the 
capacity of the flow by ^-transmissions. Then, Lemmas [T] and 
|2] are used to derive an upper-bound of the number of non- 
innovative tiJ-packets at the terminal. Finally, the total number 
of non-innovative packets received by node t is computed by 
a union bound and this completes the analysis. 

Lemma 3: For a CC over a flow path with capacity n and 
length I, for each oj, the w-transmissions form a flow of 
capacity / of at least 

[l-0[{ilVMlog{ln/e)f')).{n/q) 

with probability no less than 1 — e, so long as 

ZVlog- = o(n). (1) 



Lemma [3] is based on bounding the number of lost trans- 
missions]^ From the above, it can be deduced that / asymp- 
totically approaches its expected value, i.e., n/q, as long as 
condition ([T]i holds. The following is then easy to prove. 

Corollary 1: For a CC over a flow path with capacity n 
and length I, ^{Q^) (for each ui) is smaller than n/q — 
l{log (n/g) +log(l/e) +logZ+ 1), with probability at most e. 

Corollary [T] along with Theorem [T] result in the following 
for CC on arbitrary schedules. 

Theorem 2: A CC can deliver k information symbols to 
the terminal over a flow path with capacity n and length I 
with probability of failure no larger than e, so long as n is 
at least k + qllog{kl/e) + glog(l/e) + qlogq + q, provided 
that l^q^log{kl/e) — o{k). Furthermore, the encoding and 
decoding costs are each 0{k/q). 

IV. Further Analysis 

Through the results of Section III, it can be seen that the 
performance of CC is subject to the following factors: (1) 
the number of non-innovative packets, (2) the number of lost 
transmissions and (3) the condition of decoding completion. 

The first results from the random choices of linear com- 
binations. It is not always the best to randomly choose a 
subset of packets to be combined. There might be a solution 
to selectively combine the packets so that the coded packet is 
innovative with the highest probability. This is the case when 
each node has some knowledge of the received packets by 
the destination nodes. Thus, in our setup, there is no way to 
reduce the number of non-innovative packets. 

The second is a result of the fact that each node schedules 
chunks randomly regardless of whether they form valid paths 
of flow. More valid paths might be packed in a given schedule 
by keeping track of the transmissions. But, as long as the 
schedule is assumed to be arbitrary, the worst-case analysis 
in Section III-B is an appropriate approach. Therefore, the 
upper-bound on the number of lost paths cannot be improved. 

Last, but most important, is the decoding completion con- 
dition. Each chunk has to be decoded separately when the 
chunks do not overlap in any symbol, i.e., Q has rank k only 
if each Q^^ has rank k/q. This makes the speed of convergence 
to the capacity very slow. A question is whether one can 
devise a chunking method by which does not necessarily 
have to have rank k/q for Q to have rank k. We answer 
this question in positive by allowing chunks to overlap. This 
relaxes the decoding completion condition resulting in a faster 
speed of convergence. The following gives the intuition behind 
the overlapping chunks. 

Suppose the network is reduced to an erasure channel with 
an unknown rate of packet loss. The following results are 
straightforward, though they are not special cases of the results 
in Section III. (There is no internal node across the network 
and so we do not need to worry about the density loss caused 

^We bring tliis to tlie attention of the reader that Lemma[3]is different from 
[2, Theorem 4.1], though the proofs have generally a similar structure. 



by the sub-matrix Vi of 

Corollary 2: A Dense Code fails to deliver k information 
symbols over an erasure channel with probability of at most e, 
so long as ri > /j + log 1/e. The coding costs are each 0{k). 

Corollary 3: A CC can decode k information symbols from 
n packets received by the terminal with probability of at least 
1 — e, so long as n is larger than k + q log l/e + q log q. The 
encoding and decoding costs are each 0{k/q). 

By Corollary [3] the more the chunks, the better the computa- 
tional efficiency. The Chunked Codes, however, are capacity- 
achieving so long as the number of chunks is taken to be 

(/c/log k). This provides a lower-bound on the computational 
cost and a simple calculation shows that the encoding and 
decoding costs can not be reduced down to O(logfc). 

Recently, Studholme and Blake [6] introduced a class of 
erasure codes, in essence, similar to CC that operate on k 
chunks, each of size a such that any two consecutive chunks 
overlap in all but one symbol. The chunks are scheduled in the 
same way as CC. It has been shown in [6] that for sufficiently 
large a > y/k, this coding scheme achieves the capacity of 
any erasure channel as fast as Dense Codes, i.e., with the 
overhead 0(log/c). This comes from the fact that the matrix 
Q in this case behaves indistinguishably from the Q in the 
dense coding schemes. This is while the Q in the scheme 
of [6] has a structure similar to the Q for CC. The encoding 
and decoding costs are each 0(\/fc) and a simple comparison 
indicates that Chunked Codes having the same computational 
complexity need larger overhead of 0{Vklogk). This shows 
that the rank of Qi is much less dependent on the rank of each 
when the chunks overlap as compared to the case with 
non-overlapping chunks. In the next section, we use this idea 
to design Overlapped Chunked Codes (OCC) which provide 
a better tradeoff between the speed of convergence and the 
coding costs. 

V. Overlapped Chunked Codes 
A. Preliminaries 

Conjecture 1: [6, Conjecture 4.2] Consider a k x n binary 
matrix M whose columns are each chosen independently at 
random with nonzero entries restricted to lie within an aperture 
of sufficiently large size a > \fk. Then M has rank less than 
k with probability of at most 2^("^'^). 

Corollary 4: Let M be a matrix as described above except 
that only q apertures are to be chosen. The set of q apertures 
are a subset of apertures in which any two contiguous apertures 
overlap in 7 symbols. For any 7 > \fk, M fails to have full 
row rank with probability of at most 2~("~'^). 

A Chernoff bound shows the matrix M includes njq 
columns pertaining to any aperture with probability of at least 

1 — e, so long as 

g^log- = o(,i). (2) 

We notice that condition (|2]l is a weaker version of condi- 
tion ([TJ. Corollary |4] thus brings about the following. 

^This seems to have been overlooked in [2] when comparing CC to LT 
Codes [5] over an erasure channel. 



Corollary 5: Let Af be a /c x n binary matrix. Take q 
apertures of size a such that any two contiguous apertures 
overlap in sufficiently large 7 > \fk symbols. Furthermore, 
suppose that each aperture appears mnjq columns. Then, M 
does not have rank k with probability of at most 2~("~'^)+'°s<^, 
so long as condition Q is met. 

B. Analysis 

Suppose that the input message of size k is partitioned into 
q chunks, each of size a, and each node randomly chooses a 
chunk to operate on by a random linear code. Unlike CC of [2], 
in the proposed chunking scheme any two contiguous chunks 
overlap hy j — a ~ k/q symbols in an end-around fashion. 
Furthermore, in order to ensure that all the input symbols are 
covered equally by the chunks, (a — 7) has to be an integer 
divisor of a. For the simplicity of exposition, we consider 

for any divisor t of a (1 < r < a). Note that r = 1 implies 
that there is no overlap between chunks and for any t > 1, 
the overlap is increased as t increases so that for t ^ a, any 
two contiguous chunks overlap in all the symbols but one. 

Now suppose that an OCC with the aperture size a and 
overlap 7 is applied to a flow path of capacity n and length /. 
By Corollary [T] the number of dense columns associated with 
each chunk oj in Qi is at least / — I log(///e), with probability 
no less than 1 — e, where / denotes the capacity of the flow of 
w-transmissions. The concentration theorem then implies that 
/ is about n/q and this fails with arbitrarily small probability 
if the number of chunks has been chosen with care, i.e., if 
condition ([T]) holds. Thus for any t > 1, as long as 7 > Vk, 
Corollary [5] applies. 

Theorem 3: An OCC with q chunks of size a and overlap 
7 > y/k fails to deliver k information symbols on a flow path 
with capacity n and length I with probability no more than e, 
so long as 

n> k + qllogkl/e + ql + log 1/e + 1, 
provided that 

I'^q^ log{kl/e) = o(fc) (3) 

and 

T^o{a/il^qlogikl/e))). (4) 

Moreover, the encoding and decoding costs are each 0{a). 

Let T* be the smallest divisor of a such that 7 > Vk. For 
now, the aperture size is assumed to be chosen so that such 
T* does exist. For any r > r*, 7 > y/k, and so the above 
result holds. For a given aperture size, however, the larger the 
T, the more are the chunks, and by Theorem [3] the slower is 
the speed of convergence. We resolve this by setting r = r*. 

Now suppose there is no such r* for a given a, but 
condition ([TJ still holds for some t. It can be easily seen 
that Ql has the structure as specified in Corollary |4] except 
that 7 < \/k. To the best of our knowledge, the rank property 
of any such matrix with r 7^ 1 is an open problem. But at 



Fig. 2. The performance comparison of Dense Codes (DC), CC and OCC 
over line networks of lengtli 4. 

the very least, the rank property of the class of such matrices 
with T = 1 can be thought of as a lower-bound for any such 
matrix with t 

Theorem 4: An OCC with q chunks of size a and overlap 
7 < \fk can successfully transmit fc information symbols 
over a line network of length I with an arbitrary schedule 
of capacity of at least k + ql log kl/e + k, where k < 
q log l/e + q log q + q, given that conditions (|3]l and Q are 
met 1^ Moreover, the coding costs are each 0{a). 

C. Concluding Remarks 

Here we compare CC to OCC, both for arbitrary schedules 
over line networks. 

Remark 1: The speed of convergence in both coding 
schemes is dominated by the largest term which is the same. 
For finite-length codes, however, even the smallest term does 
make a difference as shown in the simulations. 

Remark 2: For a fixed number of chunks, the overlapped 
chunked coding scheme is capacity-achieving with larger 
speed of convergence. This however comes at the cost of 
slightly increasing the computational complexity. (The aper- 
ture size needs to become larger to provide overlap between 
the chunks.) That is, OCC can provide a better tradeoff 
between the speed of convergence and the coding costs. For a 
given aperture size, however, OCC asymptotically achieve the 
capacity with lower speed when compared to CC. 

Remark 3: Regarding the complexity considerations in 
practical scenarios, suppose that the aperture size is bounded 
above so that none of the two coding schemes is capacity- 
achieving with an asymptotically vanishing overhead. OCC 
can be provably shown to achieve the capacity faster than CC 
(but the overhead per symbol does not vanish with k) with the 
same encoding/decoding complexity. 

VI. Simulation Results 

In this section, the performance of the coding schemes over 
the line networks are studied through simulations. We consider 
networks with length I e {1,2,4}. The networks are each 
simulated with randomly generated schedules of capacity n, 

'The upper bound on k is tight for r = 1, though it becomes looser as the 
value of T increases. 



Fig. 3. The performance comparison of CC and OCC with a = 16 over 
line networks of length 1 and 2. 

for integers n S {1024, 1984}. The number of information 
symbols, k, is set at 1024 and each symbol is assumed to be 
a binary random variable. We are interested in the probability 
of successful decoding, i.e., the event of recovering the k 
information symbols at the terminal, as a function of the 
network capacity. We would also like to investigate how this 
function changes with I, the number of chunks q, and the 
overlap 7. For different values of n and /, we apply the coding 
schemes to random schedules until we have 100 successful 
decoding events. 

In the first experiment, we set q = 2 or 4 for CC and 4 
for OCC. This is to stay faithful to conditions ^ and Q. 
Figure |2] shows the results for I = 4. The asymptotic upper 
bounds are also given for reference. In general, our results for 
different values of I show that OCC is always superior to CC 
(this can also be seen in Figure[2]i. We notice that it should not 
be surprising that the finite-length results in Figure |2] surpass 
the asymptotic upper bounds, as the former is an average value 
while the latter is obtained based on a worst-case analysis. 

In the second experiment, fixing the computational com- 
plexity (for a fixed aperture size, e.g., a ~ 16), we study the 
performance of OCC with different overlaps. This will result 
in larger values of q, too large to comply with conditions ([3]) 
and We set g = 64 for CC and q e {128, 256, 512, 1024} 
for OCC. Figure |3] shows the advantage of OCC in reducing 
the overhead. It can also be seen (in our simulations) that there 
is always an optimal 7 7^ which maximizes the probability 
of successful decoding. 
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